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Abstract 



In this paper we use a hybrid Monte Carlo-Optimal quantization method to 
approximate the conditional survival probabilities of a firm, given a structural 
model for its credit default, under partial information. 

We consider the case when the firm's value is a non-observable stochastic 
process (Vt) t>0 an d investors in the market have access to a process (St) t>a , 
whose value at each time t is related to (V s ,s < t). We are interested in 
the computation of the conditional survival probabilities of the firm given the 
"investor information" . 

As an application, we analyze the shape of the credit spread curve for zero 
coupon bonds in two examples. 

Keywords: credit risk, structural approach, survival probability, partial informa- 
tion, filtering, optimal quantization, Monte Carlo method. 
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Introduction 



In this paper we compute the conditional survival probabilities of a firm, in a market 
that is not transparent to bond investors, by using both Monte Carlo and optimal 
quantization methods. This allows us to analyze the credit spread curve under partial 
information in some examples, in order to investigate the degree of transparency and 
riskiness of a firm, as viewed by bond-market participants. 

To introduce the problem, recall that most of the bonds traded in the market are 
corporate bonds and treasury bonds, that are consequently subject to many kinds 
of risks, such as market risk (due for example to changes in the interest rate), coun- 
terparty risk and liquidity risk. One of the main challenges in credit risk modeling 
is, then, to quantify the risk associated to these financial instruments. 

The methodology for modeling a credit event can be split into two main ap- 
proaches: the structural approach, introduced by Merton in 1974 and the reduced 
form approach (or "intensity based"), originally developed by Jarrow and Turnbull 
in 1992. 

The structural approach consists in modeling the credit event as the first hitting 
time of a barrier by the firm value process. 

In reduced form models the default intensity is directly modeled and it is given 
by a function of latent state variables or predictors of default. 

The first approach, in which we are interested, is intuitive by the economic point 
of view, but it presents some drawbacks: the firm value process can not be easily 
observed in practice, since it is not a tradeable security, and a continuous firm's 
value process implies a predictable credit event, leading to unnatural and undesirable 
features, such as null spreads for surviving firms for short maturities. 

Despite the apparent difference between the two models (see, e.g., Jarrow and 
Protter, 2004), some recent results, starting from the seminal paper Duflie and Lando 
(2001), have unified the two approaches by means of information reduction. We 
consider, then, a structural model under partial information, in which investors can 
not observe the firm value process, but they have access to another process whose 
value is related to the firm value process. We show in two examples that yield spreads 
for surviving firms are strictly positive at zero maturity, since investors are uncertain 
about the nearness of the current firm value to the trigger level at which the firm 
would declare default. The shape of the term structure of credit spreads may be 
useful, then, in practice to estimate the degree of transparency and of riskiness of a 
firm, from the investors' point of view. 

We show that the computation of the conditional survival probabilities under 
partial information leads to a nonlinear filtering problem involving the conditional 
survival probabilities under full information. These former quantities are approxima- 
ted by a Monte Carlo procedure, while the filter distribution is estimated by optimal 
quantization. 

The paper is organized as follows. In the first section, we present the market 
model and we decompose our problem into two problems (PI) and (P2), that are, 
respectively, the computation of conditional survival probability in a full information 
setting and the approximation of the filter distribution. Section [2] and Section [3] are 
devoted to the solution of the previous two problems. We provide error estimates 
in Section [4] and, finally, in Section [5] we present two numerical examples concerning 
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the application to credit risk. 



1 Market model and problem definition 

Let us consider a probability space (f2, T, P), representing all the randomness of our 
economic context. 

For the moment we concentrate our attention on the "real world" probability measure 
P and on a single firm model, in which the company is subject to default risk and 
we use a structural approach to characterize the default time. 

The process representing the value of the firm, given for example by its value of 
financial statement, is denoted by (Vt) t>0 and we suppose it can be modeled as the 
solution of the following stochastic differential equation 

f dV t = b(t,V t )dt + a(t,V t )dW t , 

\ V = vo, 1 • > 

where the functions b : [0, +oo) x R — > R and a : [0, +oo) xl^l are Lipschitz in x 
uniformly in t and If is a standard one-dimensional Brownian motion. We suppose 
that a(t, x) > for every (t, x) G [0, +oo) x R. 

In our setting the process V is non observable (it is also known as state or signal), 
but investors have access to the values of another stochastic process S, providing 
noisy information about the value of the firm, that can be thought, for example, as 
the price of an asset issued by the firm. 
This observation process follows a diffusion of the type 

| dS t = S t [ip(V t )dt + v(t)dW t + 5(t)dW t ] , 
\ So = s , 

where t/j is locally bounded and Lipschitz, v and 5 are deterministic functions and W 
is a one-dimensional Brownian motion independent of W. Note that in this model 
the return on S is a (nonlinear) function of V . 

Finally, following a structural approach, we define the default of the company as 

r := inf {* > : V t < a}, (1.3) 

where as usual inf = +oo and for a given constant parameter a £ R, < a < vo- 

In numerical examples we will consider models where Vt £ (0, +oo) (eventually 
by stopping the process V at the default time r by considering the process (Vt/\ T ) t ). 

We will deal with two different nitrations, representing different levels of in- 
formation available to agents in the market and we suppose that they satisfy the 
usual hypotheses: a filtered probability space (f2, J 7 , (J 7 t) t>0 , P) satisfies the usual 
hypotheses if J-q contains all P-null sets and if the filtration is right-continuous. 

The first and basic information set is the "default-free" filtration, the one gene- 
rated by the observation process S, which we will denote, for each t > 0, 

■= a(S s ,0< s<t) 

and the second one is the full information filtration (£t) t > , i-e., the information 
available for example to a small number of stock holders of the company, who have 
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access to S and V at each time t. In our case, the full information filtration is the 
one generated by the stochastic pair process (W, W). In conclusion we have 

ff £Gt, V t > 0, 

and we observe that the following immersion property holds (see for example Coculescu, 
Geman and Jeanblanc, 2008 for an analogous analysis): 

Lemma 1.1. Any (J-f) t -local martingale is a (Qt) t -local martingale. We will say, 
then, that filtration (J-f) t is immersed in the full filtration {Gt)f 

For a fixed s > we observe the process S from to s and we suppose that the 
default r occurs after s, so that, 

r = inf{t > s : V t < a}. 

Suppose now that a finite time horizon T is fixed. For a given t, such that s < t < T, 
our aim is to compute the following key quantity 
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{r> s y 



inf V u > a 

s<u<t 



(1.4) 



We will see in the following Section [5] how this quantity plays a fundamental role 
(if computed under a pricing measure) in the computation of credit spreads for zero 
coupon bonds. 

1.1 Reduction to a nonlinear filtering problem 

Using the law of iterated conditional expectations, the Markov property of V and 



the independence between W and W, we find, for each (s, t) £ 



~,s<t, 



inf V u > a 

s<u<t 



u a 



E 



E 



p 


( inf 




> a 




J s 




\s<u<t 






p 


( inf 


Vu 


> a 


v s ) 


F s 

J s 




\s<u<t 







E[F(s,t,V s )\^ 



-a.s. 



where, for every x £ 



F(s,t,x) :-- 



inf V u > a 

s<u<t 



V 



(1.5) 
(1.6) 



Finally, 

(PI) if we compute F(s,t,x) for every x S R, which is now a conditional survival 
probability given the full filtration, and 

(P2) if we obtain the filter distribution at time s, i.e., the conditional distribution 
of V s given J=f , Il^^s, 

then we are done, since it suffices to compute the integral 
E[F(s,t,V s )\^] = 



F(s,t,x)U Vsl:F s(dx) 

) 

poo 

/ F(s,t,x)H Vs]:F s(dx). 

J a 



It remains to solve the two "intermediate problems" (PI) and (P2). Let us consider 
first problem (P2). 
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2 Approximation of the filter by optimal quantization 



Let us recall first some facts about optimal vector quantization. 
2.1 A brief overview on optimal quantization 

Consider an Revalued random variable X denned on a probability space (Q,,A,F) 
with finite r-th moment and probability distribution Fx- Quantizing X on a given 
grid T = {x 1 , • • • ,x N } consists in projecting X on the grid T following the closest 
neighbor rule. The induced mean L r -error 

\\X - Proj r (X)\\ r = ll^minjX-x^llr 

is called the U ' -mean quantization error and the projection of X on T, Proj r (X), is 
called the quantization of X. As a function of the grid T the L r -mean quantization 
error is continuous and reaches a minimum over all the grids with size at most N. 
A grid r* minimizing the L r -mean quantization error over all the grids with size at 
most N is called an L r -optimal quantizer. 

Moreover, the L r -mean quantization error goes to as the grid size N — ► +oo 
and the convergence rate is ruled by Zador theorem: 

min \\X - Proj r (A)L = Q r (F x )N' 1/d + o(A~ 1/(i ) 
r, \t\=n v ' 

where Q r (Fx) is a nonnegative constant. We shall say no more about the basic 
results on optimal vector quantization. For a complete background on this field we 
refer to Graf and Luschgy (2000). 

The first application of optimal quantization method to numerical probability 
appears in Pages (1997). It consists in estimating Ef(X) (it may also be a conditional 
expectation) by 

N 

Ef{Proj r *(X))=J2f(x*' i )Pi (2-1) 
i=i 

where F* = {x*' 1 , ■ ■ ■ ,x*' N } is an L r -optimal grid for X and pi = P(Proj r *(X) = 
x*' 1 ) . The induced quantization error estimate depends on the regularity of the func- 
tion /. 

• If / : W 1 M. is Lipschitz continuous and r > 2, then 

|E/(A)-E/(Proj r .(A))| < E\f(X) - /(Proj r *(X)) | 

< [/] Lip ||X-Proj r *(X)||i 

< [/] Lip ||X-Proj r ,(A)|| 2 . 

• If the derivative Df of / is Lipschitz and r > 2, then, for any optimal grid T*, we 
have 

|E/(A)-E/(Proj r ,(A))| < [Df] Up \\X - Proj r *(X)|||. 

How to compute numerically quadratic optimal quantizers or L r -optimal (or 
stationary) quantizers in general, the associated weights and L r -mean quantization 
errors is an important issue from the numerical point of view. Several algorithms 
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are used in practice. In the one dimensional framework, the L r -optimal quantizers 
are unique up to the grid size as soon as the density of X is strictly log-concave. In 
this case the Newton algorithm is a commonly used algorithm to carry out the L r - 
optimal quantizers when closed or semi-closed formulas are available for the gradient 
and the hessian matrix. 

When the dimension d is greater than 2, on the contrary, the //-optimal grids 
are not uniquely determined and all //-optimal quantizers search algorithms are 
based on zero search recursive procedures like Lloyd's I algorithms (or generalized 
Lloyd's I algorithms which are the natural extension of the quadratic case), the 
Competitive Learning Vector Quantization (CLVQ) algorithm (see Gersho and Gray, 
1992), stochastic algorithms (see Pages, 2008, and Pages and Printems, 2003), etc. 
From now on we consider quadratic optimal quantizers. 

2.2 Estimation of the filter 

We focus now on problem (P2) and we present here a solution based on optimal 
quantization, as suggested in Pages and Pham (2005). 

For an overview on nonlinear filtering problems in interest rate and credit risk mo- 
dels we refer to Frey and Runggaldier (2009) and references therein and, focusing on 
filtering theory in credit risk, we also have to mention the seminal papers Kusuoka 
(1999) and Nakagawa (2001). 

We consider a general discrete time setting, in which we recall the relevant for- 
mulas and the desired approximation of the filter (see, e.g., Pages and Pham, 2005 
and Pham, Runggaldier and Sellami, 2005, for a detailed background). We introduce 
a probability space (Q,.4, P) (notice that P is not the same measure we considered 
in Section [TJ but for simplicity we will use the same notation) and we suppose that: 

• the signal process (X k ) keN is a finite-state Markov chain taking values in 
the space E, with known probability transition, from time k — 1 to time k, 
Pk(x k —i, dxfc), k > 1, and given initial law fj,, and 

• the observation process is an Revalued process (Y/%) fcgN such that Yq = yo and 
the pair (X k ,Yk) k ^N is a Markov chain. 

Furthermore, we suppose that for all k > 1 

(H) the law of Yfc conditional on (X k _i, Y k _i, X k ) admits a density 

Vk >-> 5fc(-Xfc_i, Y fc _i, X k ,y k ), 

so that the probability transition of the Markov chain (X&, Yfc) fceN is given by 
Pk(xk-l,^k)9k(x k -i,y k -i,x k ,y k )dy k , with initial law n(dx )5 (dy ). 

In this discrete time setting we are interested in computing conditional expecta- 
tions of the form 

ny, n / := E | Yi,...^], 

for suitable functions / defined on E, i.e., we are interested in computing at some 
time n the law Ily in of X n given the past observation Y = (Yl, . . . , Y n ). Having fixed 
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the observation Y = (Y±, . . . , Y n ) = (y±, . . . , y„) =: y we will write tly, n instead of 

ny 5 „. 

It is evident that in the case when the state space of the signal consists of a 
finite number of points, the filter is characterized by a finite-dimensional vector: if 
for example each X k takes values in a set {x\, . . . ,x k k } (as in the case where we 
quantize a process X at discrete times t k ,k = 0, - ■ ■ , n with grids of size N k ), then 
the discrete time filter distribution will be fully determined by the iV^-vector with 
components 

JP Ytk = F(X k = x) e \Y 1 ,...,Y k ), i = l,...,N k . 

It is for this reason that, following Pages and Pham (2005), we apply optimal quanti- 
zation results in order to obtain a spatial discretization, on a grid T k = {x k , . . . , x k h }, 
of the state X k , k = 0, . . . , n, and to characterize the filter distribution by means of 
the finite number of points {xo,x}, . . . ,x^ l ,x\, . . . ,x^ 2 , . . . ,x„, . . . ,x^ n } making up 
the grids {T k ) k . 

In what follows we recall the basic recursive filtering equation, that we will use 
in our numerics to approximate the filter. By applying the Markov property of X 
and (X, Y) and Bayes' formula, we find: 

IW = ^4. ( 2 - 2 ) 
where ir y ^ n is the un-normalized filter, defined by 

%,n/ = / "•• / f( x n)Kdxo)Y[gk(xk-i,yk-i,Xk,yk)Pk(x k - 1 ,dx k ). (2.3) 
J J k=l 

Equivalently, we recall the following recursive formula, that can be directly obtained 
as well by applying Bayes' formula and the Markov property: 

n^dxfc) oc J g fc (x fc _i,y fc _i,a; fe ,y fc )P fc (a; fc _i,da; fc )IIj /jfc _i(da; fe _i), 

where now y in H yk _i represents the realization of the vector {Y\, ■ ■ ■ , Y k _i) and we 
do not have equality because we need to re-normalize. 
Now for any k £ {1, • • • , n} note that 

k 

n y ,kf = E(j(X k ) Y[9i( x i-l,yi-l, x i,Vij) ■ 
i=l 

Therefore, introducing the natural filtration of X, {F k ) ke jq, we have 

= E ^E (f(X k )g k (X k _i,y k ^i,X k ,y k )\J 7 k x _ 1 ) Y[gi( x i-i,yi-i, x i,yi)j 
= E ^Hy )k (f(X k _i))J^gi(Xi-i,yi-i,Xi,yi)^J , (2.4) 



8 



where H yk , k = 1, . . . , n, is a family of bounded transition kernels defined on bounded 
measurable functions / : E — > R by: 

H y>k f{x k -i) := E[f(X k )gk(x k -i,y k -i,Xk,yk)\Xk-i = Xk-i] 

f{xk)gk(xk-i,yk-i,Xk,yk)Pk{xk-i,dx k ), (2.5) 



with Xk-i € E. For every x £ E, furthermore, we have 

H yfi f(x) := TT y , f = E lf(X )} = J f(x )fx(dx ). 

It follows, then, from ([2^j) that 

%,fe/ = ^y,k~iHy :k f, k = l,...,n, (2.6) 
so that we finally obtain the recursive expression 

In order to obtain the discrete time approximation of the desired filter ILyj-ps, 
that is needed to solve (P2), we now fix a time discretization grid to = < ■ ■ ■ < 
t n = s in the interval [0, s] and we apply the previous results by working with the 
corresponding quantized process V (we identify X with V and Y with S). 

Before focusing on the discrete time filter approximation, let us make the follow- 
ing remark concerning the conditional law of St given ((V u ) u€ [ s ^, S s ). This will be 
useful to be sure that in our case hypothesis (H) is verified. 

Remark 2.1. Let s <t. Using the form of the solution of the SDE (|1.2p 

S t = S s exp (V04) - \{v 2 (u)+5 2 {u))y u + j\{u)dW u + j\(u)dW^\ , 
we notice that 

C[S t \{V u ) s < u <t,S 8 ) =LN(m Si(;( 7 s 2 t ), (2.7) 

where 



m M = logOS.) + f U{V U ) - \{y\u) + 5\u)) - u(u)^^{)du 
J s V 2 a{u,V u )J 

and 

rt 

5 2 (u)du. 



+ / ~TxF\ u 

a{u,V u ) 



LN(m; a 2 ) stands for the lognormal distribution with mean m and variance a 2 . 

Now, suppose that we temporarily have a time discretization grid from to t: 
no = < u\ < ■ ■ ■ < u m = t. For m large enough we can estimate the mean 
and the variance appearing in Equation (|2.7|) by using an Euler Scheme. When the 
estimations of the mean m s j and variance a 2 t between two discretization steps are 
respectively denoted by mk and a\ we have: 

C(Sk\Vk-i,S k ^i,V k ) =LN(m k ;a 2 ) (2.8) 
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with 

m k = loggfc-i + (v?(Vfc-i) - j- {y 2 {u k -ij + 6 2 (u k -i)) - v(u k -\) b ^ k ~ 1 ' Vh i~ l> ) A fc 
V 2 cr(Mfc-l, Vfc_iJ ' 



H 7 77 V^^' 



°k = 5 2 (u k _i)A k 



where 5 fe := S Uk ; V k := V^ fc ; AV k = V k - V k -i; A k = u k - u fc _ L Recall that the 
density of the lognormal distribution with mean m and variance a 2 , LN(m; a 2 ), reads 



sit 2\ 1 ^7 (log x— m) 



We finally notice, then, that the density of S k given (V k -%, S k -i, V k ) only depends 
on (Vk-i, S k -i, Vk). 

Remark 2.2. In the specific case where 

dV t = fiV t dt + aVtdWt, V = v , 

dS t = fiS t dt + aS t dWt + 5StdW t , S = s , 

we directly deduce from Remark 1 2 . 1 1 that for every s < t 

£{S t \(V u ) s < u < t ,S s ) =LN (^log(^) -ld 2 (t-s);6 2 (t-s)) . 



[> Estimation of the filter. The method is already studied in Pages and Printems 
(2003) and consists first in quantizing for every time step t k the random variable V k 
by considering 

V k = Vro] Vk {V k ), fc = 0,...,n, (2.9) 

where for every k, T k is a grid of N k points vt, i = 1, • • • , N k to be optimally chosen 
and where Projp fc denotes the closest neighbor projection on the grid T k . 
Owing to Equation f|2.6|) our aim is to estimate the filter using an approximation of 
the probability transition P k (v k -i,dv k ) of V k given V k -\. These probability transi- 
tions are approximated by the probability transition matrix p k := (jfi) of V k given 

Pi = n% = v j k \Vk-l = vi_ 1 ), » = 1,-.. ,iV fc _i, j = l,---,N k . (2.10) 

Then, following Equation (|2.5p . by fixing the observation y := (yo,--- ,y k ) (for 
notational simplicity we indicate, as previously in the general case, the observations 
of S by y), we estimate the transition kernel matrix H y k by the quantized transition 
kernel H Vjk given by 

H v,k = ^2 H y,k S v i k _ 1 i i = I)'" ,N k -i, k = 1, 
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where 

and where the ui's, j = 1, • • • ,Nk are the (quadratic) optimal quantizers of We 
also estimate fly,o by 

8=1 

This leads to the following forward induction to estimate %,n : 

TTj/,0 = Hyfi, TTy,k = ^y,k-lH y ^k, k=l,-' ,Tl, ( 2 -H) 

or, equivalently, 



%,o — fly.o 

^^(E^fl^U-i). , „, k=l, 



,71. 



'j=l,-,N k 

Finally, we estimate the desired filter at time t n = s by 



IW = (2.12) 



l> Estimation of the conditional survival probability. Owing to Equation 
(|1.5p we use optimal quantization to estimate the P(inf V u > a\Tg ) on the set 
{r > s} by 

N n 

^F(s,i,<)n^ (2.13) 
i=l 

where u* , i = 1, • • • , N n is the quadratic optimal grid of the process V at time t n = s, 
W y n is the i-th coordinate of the optimal filter ITy^ given in (|2.12|) and, for every 
%, F(s,t,v l n ) is defined as in (|1.6p . Note that this last function has in general no 
explicit expression. In such case, we will estimate it by Monte Carlo as specified in 
the next section. 

3 Approximation by Monte Carlo of survival probabil- 
ities under full information 

The aim of this section is to solve problem (PI), i.e., to compute, for each pair of 
positive values (s,t), s < t < T, 



inf V u > a 

i<U<t 



V s ^j =E(l {Jnf .^ tK>a} |Vi), (3.1) 



where in our general setting the firm value V follows a priori a diffusion of the type 
(jl.ip . Notice that in the specific case where V is a geometric Brownian motion there 
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exists a closed-formula, that we recall below. 
If 

dV t = nV t dt + aV t dW t , V = v , 

then 

\ ( a\ u ~ 2{r - a2/2) 

ml V u >a\V s )=^{h 1 {V s ,t-s))- — $(h 2 (V s ,t - s)) (3.2) 

K s<u<t I \ V s J 

where 

1 f. /x\ ( 1 2 



h 2 (x,u) = —= (log (-) + (fi- \a 2 



1 /' x ' 2 

and where <S>(x) = / e~ u ' 2 du is the cumulative distribution function of the 

standard Gaussian law (see for example Borodin and Salminen, 2002 or Revuz and 
Yor, 1999). 



Since in general we cannot use directly the result in Equation (|3.2p . we have to 
resort to an approximation method. We will adopt the "regular Brownian bridge 
method". However, note that many other methods can be used to estimate these 
probabilities, such as in Kahale (2007), where the crossing probabilities are calculated 
via Schwartz distributions in the specific case of drifted Brownian motion and in 
Linetsky (2004), where the survival probabilities and hitting densities relative to the 
CIR, the CEV and to the OU diffusions are expressed as infinite series of exponential 
densities: 

oo 
71=1 

where < Ai < A2 < • • • < A ra — > 00 as n — ► 00 and (c n ) n are explicitly given 
in terms of the solution of the Sturm-Liouville equation and the eigenvalues of the 
Sturm-Liouville problem. 



In order to find an approximated solution to problem (PI) by means of the re- 
gular Brownian bridge method, we focus on the interval [s,t] and we discretize it by 
means of uq = s < u\ < ••• < t = ujy. We denote by V the continuous Euler scheme 
relative to V. 

The regular Brownian bridge method is connected to the knowledge of the distri- 
bution of the minimum (or the maximum) of the continuous Euler scheme V relative 
to the process V over the time interval [s,t], given its values at the discrete time 
observation points s = uq < u\ < ■ ■ ■ < un = t (see, e.g., Pages, 2008). 

Lemma 3.1. 

C( min V u \V Uk =v k ,k = 0,--- ,n)=c( min (U h j) (3.4) 
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where (f7fc)fc=0,— iV-l are i-i-d random variables uniformly distributed over the unit 
interval and G~^ y is the inverse function of the conditional distribution function G Xjy , 
defined by 

( 2N \ 

G x , y {u) = exp y - - ^ a 2^ ( U ~ ~ y)J IL {min(x, 3/ )>«}- 

Notice that we have omitted the dependence on time in a. 

Recall that for a given time discretization grid Uk '■= s + N , k = 0, ■ • ■ , N, on 
the set [s,t], the continuous Euler scheme relative to the process V is defined by 

V u = Vu + b(u, Vu){u -u) + a(u, Vu){W u - wy , V s = v s , 

with u = Uk if u £ [iifc, itjfc+i). We deduce from the previous lemma the following 
result. 



Proposition 3.2. 



mm t K>a|^)=E^nG^^ +i (a)|^ 



Proof. We have (recall that V s = V % 



no ; 



min K > a y s = E P min V u > a\V Uk ,k = 0,...,N\ \ V, 

s<u<t I V \ s<u<t 



E I ^{mino^^iv V N >a} eX P (, ~ 1=1 2^=0 )\ V sJ 



\ fc=0 / 
\fc=0 / 

which gives the announced result. □ 
By using Proposition [372], we estimate the survival probability under full information 

P ( inf V u > a\V s = v 

\s<u<t 1 

by the following Monte-Carlo procedure: 

• Time grid specification. Fix uq = s < u\ < ■ ■ ■ < t = un , the set of N + 1 
points for the (discrete time) Euler scheme in the interval [s,t]; 

• Trajectories simulation. Starting from v and having fixed M (number of Monte 
Carlo simulations), for j = 1, . . . , M, simulate the discrete path (V£ k ) k=0 N ; 

• Computation of the survival probability. For j = 1,...,M, compute (recall 
that, for every j, V£ Q = v) 

N-l 

Ps,t( v '> a ) := TT yi (a). (3.5) 

k=0 
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Monte Carlo procedure. Finally, apply the Monte-Carlo paradigm and get the 
following approximating value 



inf V u > a 

s<u<t 



V,=V 



M 



(3.6) 



As a consequence, combining formulas (|2.13p and (|3.6p leads to the following 
hybrid Monte Carlo - optimal quantization formula on the set {r > s} 



inf V u > a 

s<u<t 



\ , M N n 

/ j=l i=l 

where pl t (- ;a) was introduced in (|3,5p . 



(3.7) 



4 The error analysis 



We now focus on the error done by approximating P I inf V u > a 

\ s<u<t 

M N: 



by 



1 ^ 



j=l i=0 

We distinguish three types of error. The first error is induced by the approximation 
of the filter Hy n appearing in Equation (|2.2p by tl y ^ n , defined in (|2,12p . The second 
one is the error deriving from the approximation of 



inf V u > a \V S = v ] by 

;<u<t 1 



inf V u > a \V S = v 

s<u<t 



where V is the (continuous) Euler scheme relative to the process V. The last one 
is the error arising from the approximation of the survival probability under full 
information by means of Monte Carlo simulation. 

Note that in the case when Equation (jl.ip admits an explicit solution, as in the 
Black-Scholes model, there is no need to use an Euler scheme, so that the second 
kind of error has no more to be taken into account. 



[> Filter approximation error. In order to have some upper bound of the quanti- 
zation error estimate of Hy in F(s, t, •) let us make the following assumptions, as done 
in Pages and Pham (2005). Notice that they are stated in the more general setting 
of Section \2.2\ so that we have to identify V with X and S with Y . 



(Al) The transition operators Pk(x,dy) of given Xk-i, k 
chitz. 



1, 



, n are Lips- 



Recall that a probability transition P on E is C-Lipschitz (with C > 0) if for 
any Lipschitz function / on E with ratio [f]hip, Pf is Lipschitz with ratio [P/]lm> < 
C[f]u P - Then, one may define the Lipschitz ratio [-P]_Li p by 



[P] 



Lip 



sup { L I) LlV i f a nonzero Lipschitz function | < +oo. 
1 UlLip > 
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If the transition operators P k (x, dy), k = 1, ■ ■ ■ , n are Lipschitz, it follows that 

[P]u P ■= max [Pk\Li P < +oo- 
k=l,— ,n 

(A2) (i) For every k = l,--- , n, the functions g k (recall hypothesis (H)) are 
bounded on E x W x E x R 9 and we set 

K" := max ||p fc ||oo- 

a fc=l,— ,n 

(ii) For every k = 1, • • • , n, there exist two positive functions [fi^Lip and 
[sIl-Lip defined on R q x R 9 so that for every x, x', x,x' £ E and y, y' G IR 9 , 

\g k (x,y,x',y')-g k (x, ,y')\ < [gl]Li P (y,y') \x-x\ + [gl] Lip (y,y') \x'-x'\. 

The following result gives the error bound of the estimation of the filter (see Pages 
and Pham, 2005, for details of the proof). 

Theorem 4.1. Suppose that Assumptions (Al) and (A2) hold true. For eve- 
ry bounded Lipschitz function F on E and for every n-tuple of observations y = 
(yi, ■ ■ ■ ,y n ), we have for every p>l, 

K n n 

(pn{y)V<t>n(y) k=0 

with 

4>n{y) ■= TTj/^l, <fin(y) ■= %,nl 

and 

B%(F,y,p) := (2 - 5 2 ^)[P] n L ^[F] Lip + 2 (^^([gl +1 } Lip (y k ,y k+1 ) + [g 2 k ] L ip(y k -i, 

+ ( 2 -5 2 J^ [PYup^ii^Lipiyj-uyj) + [P}Lip[g]]Lip{yj-i, 
9 j=k+i 

(Convention: go = g n +i = and S n , P is the usual Kronecker symbol). 

> Error induced by the Euler scheme. We here refer to Gobet (1998), in which 
the author starts by investigating the case of a one-dimensional diffusion and to the 
successive related article Gobet (2000) for the multidimensional case. In the two 
papers the considered diffusion has homogeneous coefficients b and a. We start by 
recalling here some important convergence results we find therein, we will then adapt 
these results to our case. 

Suppose that X is a diffusion taking values in R, with Xq = x, and define r' as 
the first exit time from an open set Del: 

t' :=inf{n>0: X u D} . 

Let t' c denotes the exit time from the domain D of the continuous Euler process 
X. In order to give the error bound in the approximation of K x (l| T / >t }/(Xf)) by 
E-c (l{ T( / >t }/(Xt)) the following hypotheses are needed: 
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function and a is in C£°( 



(HI) 6isaC 6 c 

(H2) there exists oq > such that Vx € M, a(x) 2 > <Tq (uniform ellipticity) , 
(H3) P x (inf te[0)T] X* = a) = 0. 

The following proposition states that, under hypothesis (H3), the approximation 
error goes to zero as the number of time discretization steps goes to infinity. 

Proposition 4.2 (Convergence). Suppose that b and a are Lipschitz, D = (a, +oo) 
and that (H3) holds. If f G C%(D,R) then, 



lim 



E x [l {r , >T} f(X T )} - E x [l w>T} f(X T )} 



0. 



Note that in the homogeneous case, when D = (a, +oo), a sufficient condition in 
order for (H3) to hold is 

a(X ) ± 0. (4.2) 
On the other hand, the rate of convergence is given by the following 

Proposition 4.3 (Rate of convergence). Under hypotheses (HI) and (H2), if 
f £ Cl(-D,R), then there exists an increasing function K(T) such that 



E x [l {T , >T} f(X T )]-E x [l {T , >T} f(X T )] 



< 



(i) 



K(T)\\f\\^ 



where 



(i) 

D 



Ej=o su PzeD l-f 0) O)l- 



Remark 4.4. One notes that the two previous propositions still hold when the 
diffusion coefficients are in- homogeneous, as in our general setting, by replacing 
Hypotheses (HI), (H2), (H3) by (I) and (J) : 

(I) b and a are C£° functions with respect to both arguments t and v, with uni- 
formly bounded partial derivatives with respect to v, 

(J) a is uniformly elliptic, i.e., 3a > such that a 2 (t,v) > a,\/(t,v) £ [0, T] x R 
and <r(0, vq) / 0. 



t> Error induced by Monte Carlo approximation. This error comes from the 
estimation of P (min s < u <t V u > a\V s = v l s ) = E (jlk=o G v Uk ,v Uk+1 i. a )\V s = v{\ , for 
every % = 1, • • • , N s , by 

E^ipj,t( u -;°) 



M 

where pi t (- ; a) is defined in (|3.5p . We have for every i = 1, • • • ,N S , 



JV-l 

E (nv. w (* =, « 

fc=0 



Ej=iJ»j,t(«5;a) 



M 



= o 

2 WM 



(4.3) 
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By adapting the previous results to our case, namely by identifying V with X 
and S with Y, one deduces an error bound for the estimation of H yin F(s, t, •) by 
n yin F MJV (s, t, x), where F MN (s,t,x) is a Monte Carlo estimation of F(s,t,-) of size 
M, based on a time discretization grid, between s and t, of size N. 

Theorem 4.5. Suppose that the Assumptions of Theorem \4-l\ and Proposition \4-3\ 
hold. Then 



\U y n F(s,t,-) -U y n F MH (s,t,-)\ < 



K" 



<t>n{y) v <t> n {y) 



^TBUF(s,t,-),y,p) \\Vk-Vkl 



fc=0 



+ O 



1 



+ o 



<M 



where n is the dimension of the observation vector y, N stands for the size of the time 
discretization grid for the Euler scheme from s to t and M is the number of Monte 
Carlo trials. Furthermore, Kg, 4> n (y), <^n{y) and B%, k = 0, . . . , n, are introduced in 
Theorem \4-l\ 



Proof. We have 

\U ytn F(s,t,-)-U yjn F MN {s,t,-)\ < \U y:n F(s,t r ) -Il y>n F{s,t,-)\ 

+ \Uy !n F(s,t, •) - IL y>n F MN (s,t, -)|. 

The error bound of the first term on the right-hand side of the above inequality is 
given by Theorem 14.11 As concerns the last term we have 

\U y>n F( S ,t,-)-U y , n F MN (s,t,-)\ = \J2Kn( F ( s ^' vi s) ~ F MM{s,t,vD) 



i=l 



< swp\F(s,t,v) - F MN (s,t,v)\ 



yji 



1=1 



sup|F(s,t,w) - F MN (s,t,v)\. 



On the other hand, we have for every v £ 



N-l 



\F{s,t,v) - F MN (s,t,v)\ < F v (T>t)-E v [Y[ G Vu k ,Vn k+l ( a ) 



k=0 



N-l 



+ 



F ( TT r ^f=^M a ) 
E A 11 G v Uk ,v Uk+1 

k=0 



M 



We then deduce from Proposition 14.31 and from Equation (|4.3p that 



\F{s,t,v)-F MN (s,t,v)\ < O 
which completes the proof. 



+ o 



□ 
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Remark 4.6. (About the hypotheses of Theorem I4.5P We consider the case when 
V is a time homogeneous diffusion. 

> It is straightforward, by using Remark 12.11 to prove that Assumption (A2) is 
fulfilled. 

> If we suppose that the coefficients b and a of the diffusion V are Lipschitz, we 
show, by using the Euler scheme relative to V, that Hypothesis (Al) holds true. 

> As concerns the Lipschitz property of the function F(s, t, •), it follows from Propo- 
sition 2.2.1 in Gobet (1998), in the case when the coefficients of the diffusion satisfy 
Hypotheses (HI) and (H2) and for t > s. 



5 Numerical results 

In the numerical experiments we deal with the estimation of the credit spread for 
zero coupon bonds. We fix s and, given the observations of S from to s, we estimate 
the spread curve for different maturities t (t > s). 

The credit spread is the difference in yield between a corporate bond and a risk- 
less bond (treasury bond) with the same characteristics. It can be seen as a measure 
of the riskiness relative to a corporate bond, with respect to a risk-free bond. If we 
suppose for simplicity that the face value is equal to 1 and the recovery rate is zero, 
the credit spread under partial information from time s to maturity t, S(s, t), equals 
(see Bielecki and Rutkowski, 2004 and Coculescu, Geman and Jeanblanc, 2008) 

„, A log Q(inf 

s<u<t 

S{s,t) = , 

t — s 

where Q is a martingale measure equivalent to P. We suppose that the market is 
complete, so that Q is unique. 

We will consider two models for the dynamics of the firm value V: the Black- 
Scholes one and a CEV (Constant Elasticity of Variance) model. In both cases we 
fix s = 1 and we estimate the spreads, given the observed trajectory of S from to s, 
for different maturities varying 0.1 by 0.1 from 1.1 to 11 (the time unit is expressed, 
e.g., in years). We set the number n of discretization points over [0, s] equal to 50 
and for every k = 1, • • • , n, the quantization grid size Nk is set to 60, with Nq = 1. 
The number of Euler discretization steps N equals 50 for t varying 0.1 by 0.1 from 
1.1 to 3.0 and N = 100 for t varying 0.1 by 0.1 from 3.1 to 11.0. The number of 
Monte Carlo trials M is set to 300000. In the quantization phase we obtain the 
optimal grid by carrying out 80 Lloyd's I procedures. 

> The Black-Scholes model. We consider the following model for the firm value's 
and the observed process' dynamics: 

dV t = V t {rdt + adW t ), V = v , , . 

dS t = St(rdt + adW t + 5dW t ), S = v , 1 1 

with r = 0.03, a = 0.05, vq = 86.3. The barrier a is fixed to 76 and 5 = 0.1. We 
have, in particular, 

dS t dV t , . 

— = — + 5dW t , (5.2) 

meaning that the return on S is the return on V affected by a noise. 
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Remark 5.1. a) Since V is not traded in the market, the return on V is not 
necessarily equal to the interest rate r. We have set the return on V to be equal to 
r in order to have a good economic interpretation of the model, 
b) Note that in this particular case all the hypotheses of Theorem 14.51 are satisfied. 

Numerical results are presented in Figures Q] and [2j Figure Q] is relative to the 
partial information case, so that three trajectories of the observed process S and the 
corresponding credit spreads are depicted. Figure [2j on the contrary, concerns the 
full information case, namely we suppose that we can observe different trajectories 
of V, that are drawn on the left, and we compute the corresponding credit spreads, 
that are shown on the right. 

Remark that in this setting we deduce from (|5,ip that 

S t = V t e~¥ 2 t+Wt. 
The correlation coefficient is then given for every t by 



p{t) :- 



,(<T 2 +S 2 )t _ I 



meaning that the firm value V is positively correlated to the observation process S. 
Observe that when a < 5, p(t) is a strictly decreasing function and goes to as t goes 
to infinity. This tell us that the a posteriori information on V given S decreases as 
the maturity t increases. This is what we observe in the spreads curves from Figures 
Q]and El since for large maturities the spreads values almost coincide for analogous 
trajectories. 




First of all, we notice that the short term spreads under partial information, 
being the default time totally inaccessible, do not vanish, as it is the case in the full 
information model. Moreover, since Vt and St are positively correlated, it is expected 
that the more the trajectory of S behaves "badly" , the higher the short term spreads 
are, as shown in Figure [Tj 



19 




Figure 2: Three trajectories of the value process in the full information case (on the left) 
and the corresponding spreads (on the right). 



In the full information setting the short term spreads are always equal to zero, 
but in "bad" situations (for example in the case of the blue trajectory on the left- 
hand side of Figure [2]) the medium term spreads can be higher than in the partial 
information model. 

The credit spreads under partial information can, then, be considered as good 
indicators of the transparency of a firm as viewed by bond-market participants. 

[> The CEV model. We suppose that the firm value's and the observed process' 
dynamics are given by 

f dV t = Vt(fJLdt + jVfdWt), V = v , (5 ) 

\ dS t = St(rdt + adW t + 5dW t ), S = v , { ' } 

where \x = r = 0.03, 7 = 744.7 (it is chosen so that the initial volatility equals 0.10), 
(3 = — 2 (notice that in this case one of the characteristics of the model is that the 
leverage effect holds: a firm value process increase implies a decrease in the variance 
of the price process return), a = 0.05, 8 = 0.1, vo = 86.3. 

Remark 5.2. Since in this situation the firm value process reaches zero with a 
positive probability (see, e.g., Jeanblanc and Yor, 2009), we have to work with the 
stopped process (V T /\t) t - 

Numerical results are presented in Figure [3l where three trajectories of the ob- 
served process S and the corresponding spreads are depicted. We first notice that 
the spreads in this example are higher than the ones in the previous example. This 
is due to the fact that in this case the observed process S is more volatile, as it can 
be seen from Figure compared to Figure HJ 

Secondly, we remark, as in the previous example, that the more the trajectory 
of S behaves "badly" , the higher the short term spreads are, as shown in Figure El 

Moreover, notice that the spread curves corresponding to the two worst S trajec- 
tories seem to cross, however a zoom in the graph shows that it is not the case and 
that the blue curve is always above the red one. This can be explained by noticing 
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that the model we use keeps the memory of all the observed path and that the blue 
trajectory of S is globally worse than the red one. 




SDCEV SPREADS for SO CEV 



Figure 3: Three trajectories of the observed process S in the CEV model (on the left) and 
the corresponding spreads (on the right). 



We end this section by pointing out some general facts concerning the methodo- 
logy we have used. 

Remark 5.3. a) The most important fact from the numerical point of view is 
that, as soon as the process V is quantized over [0, s], the survival probability 
Q(inf s < tt < i V u > a|.F,f ) is estimated for every maturity t > s without modifying 
the optimal quantization grid of V. 

b) Notice also that this method works as soon as the process V can be quantized. 

c) Dealing with the computation of credit spread we have to work under the prob- 
ability Q. In the case where S is traded in the market the drift under the measure 
Q is equal to r and we do not need to know the function t/j in order to estimate the 
survival probability. When the observed process is not traded in the market, on the 
contrary, and if we are not interested in the computation of the credit spreads, the 
survival probability can be computed under the historical probability P, so that the 
function t/j is exploited. 
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